A Comparison of Windowless and Window-Based Computational Association Measures as Predictors of Syntagmatic Human Associations
نویسندگان
چکیده
Distance-based (windowless) word assocation measures have only very recently appeared in the NLP literature and their performance compared to existing windowed or frequency-based measures is largely unknown. We conduct a largescale empirical comparison of a variety of distance-based and frequency-based measures for the reproduction of syntagmatic human assocation norms. Overall, our results show an improvement in the predictive power of windowless over windowed measures. This provides support to some of the previously published theoretical advantages and makes windowless approaches a promising avenue to explore further. This study also serves as a first comparison of windowed methods across numerous human association datasets. During this comparison we also introduce some novel variations of window-based measures which perform as well as or better in the human association norm task than established measures.
منابع مشابه
Asymmetry in Corpus-Derived and Human Word Associations
We investigate asymmetry in corpus-derived and human word associations. Most prior work has studied paradigmatic relations, either derived from free association norms or from large corpora using measures of statistical association and semantic relatedness. By contrast, we investigate the syntagmatic relation between words in adjective-noun and noun-noun combinations and present a new experiment...
متن کاملسری آمار: تحلیل جداول توافقی 2 (شاخصهای بررسی رابطه)
The P-Value cannot present a complete measure of association in medical studies considering the association between categorical variables. In such situations, measures are required to reveal the clinical importance of relation along with their statistical significance, as the effect size. This paper aims to introduce the measures of associations for categorical variables and inferences ab...
متن کاملCo-Dispersion: A Windowless Approach to Lexical Association
We introduce an alternative approach to extracting word pair associations from corpora, based purely on surface distances in the text. We contrast it with the prevailing windowbased co-occurrence model and show it to be more statistically robust and to disclose a broader selection of significant associative relationships owing largely to the property of scale-independence. In the process we pro...
متن کاملExamining the Associations of Covid-19 Vaccine News Sources with the Intention of Changing Adherence to Covid-19 Preventive Health Measures: A Online-Based Study in the North of Iran
Background: Although the scientific literature has extensively discussed the impact of the media on people’s health-related behaviors, there is little evidence on the effect of different sources of Covid-19 vaccine news on changing the intention to adhere to health protocols. Therefore, the present study was conducted to investigate the news sources of Covid vaccine 19 and the association of ea...
متن کاملThe Semantics of the Word Istikbar (Arrogance) in the Holy Quran based on Syntagmatic Relations(A Case Study of Semantic Proximity and Semantic Contrast)
The word istikbar (arrogance) is one of the key words in the monotheistic system of the Quran, which has found a special status as a special feature of the opponents and adversaries of the call to the truth. Given the prominent role of this issue in the human life system and its provision of corruption and moral deviations, it is necessary to represent the nature of the elements that make up th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009